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ABSTRACT 

The halo mass function, dn/dM, predicted by hierarchical clustering models can 
^ be measured indirectly using dynamical probes like the distribution of gravitational 

<D • lens image separations, dn/dA9, or halo circular velocities, dn/dvc- These dynamical 



o 
o 



variables depend on the halo structure as well as the halo mass. Since baryonic physics. 



. particularly cooling, significantly modifies the central density structure of dark matter 

halos, both observational distributions show a feature corresponding to the mass scale 
^ , below which the baryons in the halo can cool (i.e. galaxies versus clusters). We 

I use simplified but self-consistent models to show that the structural changes to the 

halos produced by the cooling baryons explain both distributions. Given a fixed halo 



cn 
m 

^SJ ■ mass function, matching the observed image separation distribution or local velocity 

' function depends largely on Qi, through its effects on the cooling time scales. These 

baryonic effects on the halo structure also affect the evolution of the velocity function 
, of galaxies with redshift. 

o . 

, Subject headings: cosmology: theory - galaxies: formation - gravitational lensing 

^ \ large-scale structure of universe - dark matter 

> 

«J ■ 1. Introduction 

P.. 

The number density, spatial distribution and properties of dark matter halos are now 



well understood in models based on hierarchical clustering (e.g. Jenkins et al. 2000; Sheth & 



Tormen 1999; Navarro, Frenk &; White 1996| ; Moore et al. |1998D , thanks in large part to numerical 



experiments involving high resolution N-body simulations. However, the relationship between 
these dark matter halos and astrophysical objects can be complicated by the modifications to the 
halos produced by baryonic physics and the dependence of our search and measurement methods 
on their baryonic properties. The dominant divide in the observed properties of halos is between 
galaxies and groups or clusters of galaxies. Physically this division is between halos in which the 
baryons have cooled and formed stars as compared to halos which have not (Silk |1977 ; Rees & 
Ostriker |l977| ; White & Rees |l97|; Blumenthal et al. [l98l . 



In this paper we explore how the halo mass function, dn/dM, is globally related to the 
masses or internal velocities of the objects we observe. By using dynamical probes of the halos 
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rather than luminosities we can try to avoid two problems. First, dynamical probes can provide a 
common measurement scale for both galaxies and clusters, allowing us to explore the location and 
properties of the boundary between the two types of astronomical objects. Second, by focusing on 
dynamical properties of the halos rather than luminosities, we partly avoid the issues related to 
star formation and feedback which are central to any relationship between the halo mass function 
and the luminosity function, dn/dL. 

We cannot, however, simply ignore the effects of the baryons, because the baryons significantly 
modify the halo structure when they cool (e.g Blumenthal et al. 1986 ; Mo, Mao & White |1998| ; 
Cole et al. pOO0| ; Gonzalez et al. ^000|) . Both massive galaxies and small groups of galaxies can 
have internal velocity dispersions ~ 200 km s~^, but it is difficult to infer from that fact the total, 
and presumably different, masses of the two systems. Thus, any global relationship between the 
halo mass function, dn/dM, and the observed properties of galaxies, groups and clusters should 
show a feature at the cooling mass scale Mc dividing cooled and uncooled halos. In this paper 
we quantitatively develop this theme by combining Press-Schechter ( |1974D based estimates of 
the mass function with adiabatic compression models (e.g. Blumenthal et al. 1986) to estimate 
the effects of the cooling baryons on the structure of the halo. This allows us to make relatively 
self-consistent estimates for the effects of the baryons in distorting the initial halo mass function 
into what our observational probes can measure. 

Many of the issues we will explore have been treated in earlier studies. That differences in 
cooling time scales create the division between galaxies and clusters is well known, and most of 
the effects we discuss are explicitly included in semi-analytic modelsQ of galaxy formation (e.g. 



Lacey & Silk pMl| ; White & Frenk pMl| ; Cole et al. p94| ; Baugh, Cole & Frenk [1996| ; Kauffman, 
White & Guiderdoni pJ93| ; Kauffman et al. |l99|; Somerville & Primack |l99|; Benson et al. pOOOj ; 



Cole et al. 200C). However our focus here is somewhat different and we highlight two issues which 
have not previously been emphasized. First, we focus on global probes of the mass function on 
both galaxy and cluster scales and the role of cooling in producing a measurable feature in these 
observational distributions. Second, we emphasize the importance of structural changes in the 
mass distribution of the dark matter halo rather than the effects of cooling on the star formation 
rate and luminosity /colors of galaxies which are a major focus of semi-analytic work. 

The outline of the paper is as follows. In g we review the modified Press-Schechter formalism 
which we use for determining the mass function of halos, adiabatic compression models for the 
effects of the cooling baryons on the structure of the halos, and simple cooling models for the 
determining the cooling mass scale. In §|3| we use these models to explore the distribution of image 
separations in gravitational lenses, and in ^ we illustrate the effects of the cooling baryons on the 
distribution of halos in dynamical velocities. In §^ we summarize our results. 



^ Since the cooling models adopted in the current generation of semi-analytic models are very similar, we will 
collectively refer to them as "SA" hereafter 
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2. The Properties of Halos 

Our models have three elements. First, we need to estimate the mass function of halos, 
dn/dM, and the formation epoch of each halo. Second, we need to estimate how the structure 
of the halo is modified by the cooling of the baryons. Third, we need to estimate whether the 
baryons in the halo have cooled. We will use a fixed cosmological model throughout the paper, 
the so-called "concordance" model of Ostriker & Steinhardt ( |1995| ) which provides an acceptable 
fit to a wide range of current observations. This is a ACDM model with a matter density (in units 
of the critical density) Qm = 0.3, a Hubble constant Hq = 67 km s~^ Mpc^^, and a baryon density 
Ofe = 0.04 normalized to match the abundance of rich clusters of galaxies {as = 0.9). 



2.1. The Halo Mass Function and Formation Time 



To calculate the cosmological mass function of dark matter halos we use a fit to the results of 
the Virgo simulations (Sheth & Tormen 1999; Jenkins et al. 2000| ) . We assign halos of mass M to 
peaks of height u 

where 5c = 1-69 and a{M) is the rms fiuctuation in the matter density smoothed with a top-hat 
filter on a scale = 3M/47rp with p the mean matter density. Press-Schechter (1974) theory 
postulates that the mass function can be cast in terms of z/ into a universal form 



M dn 
'JdM 



dM = f{u)dv 



where f{v) is known as the multiplicity function. We us^ (Sheth & Tormen 1999| ) 

uf{v) = A{1 + z.'-P)z.'V2e--'/2 



(2) 



(3) 



where p = 0.3 and v' = 0.707j^. The normalization constant A is fixed by the requirement that all 
of the mass lie in a given halo 

''f{u)du = l . (4) 



The Press-Schechter form is given by p — > and v' v. 

Later we will need to know the distribution of formation times for our dark matter halos. 
Following Kitayama &: Suto ( 1996| ) and Newman & Davis ( 20001 ) we model the formation time 



distribution using the extended Press-Schechter theory outlined in Lacey & Cole (1994). The 
formation time distribution, dp/dtform, is an integral over mass of the progenitors which merge 



We use this form rather than the fit from Jenkins et al. ( ^00C| ) because the range of vahdity of the latter is too 
small for our purposes. 
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to form a given halo at the observed redshift (Lacey & Cole 1994; Eq. 2.19). Because this 
integral can be difficult to evaluate numerically we in fact integrate the cumulative probability 
distribution and obtain the desired differential distribution by finite difference. Since all of the 
time dependence is in the threshold parameters 6{t) this can be cast into an elegant form and is 
very stable numerically. 



2.2. Adiabatic Compression 

The dark matter halo number densities calculated above represent "primordial" halos. To 
make contact with observations it is necessary to take into account the effects of baryons on halo 



structure and dynamics. We follow Mo et al. ( 1998 ) in estimating the modifications in the mass 



distribution of the halos created by the cooling baryons, though similar approaches are used by 
Dalcanton, Spergel & Summers ( |1997|) , Cole et al. (|2000D and Gonzalez et al. ( pOOOl) . 



We assume that the halos all have spherical profiles depending only on the mass and 
we neglect any substructure or halos-within-halos. Specifically we model the profile with the 
"universal" form|^ described by Navarro, Frenk & White ( 1996| ; hereafter NFW) 



x[l + x)^ 

where x = r jr^ is the radius measured in units of a characteristic scale r^. Each halo can then 
be characterized by two numbers. Rather than and the constant of proportionality in Eq. @, 
we take these to be the virial mass and the concentration. The virial mass M^-„ is the total mass 
inside the virial radius r^i^: the radius within which the mean density exceeds the critical density 
by a factor of Ac(z). In an Einstein-de Sitter model Ac = ISvr^ ~ 178. It is lower for < 1, 
taking the value Ac — 100 for the "concordance" cosmology. We estimated the concentration of 
the halos. 



rvir 9 / M,i 



1 + Z V8.12 X 10^2^/(7, 



-0.14 



(6) 

using the average relation found by Bullock et al. ( pOOOj ; see also Eke, Navarro & Steinmetz 2001) 



Physically this suggests that is roughly constant with redshift. Finally, the halo is assumed to 
have angular momentum J specified by its spin parameter A = J\E\^I'^ jGM^-^ ^ where the binding 
energy is computed using the virial theorem. 



vir 



We model the disk of the galaxy as an exponential disk characterized by mass = iridM^ 
and scale length r^. The disk is assumed to have angular momentum = j^J, and this is used 
to determine the disk scale length. If jg^ = m^, the specific angular momentum of the disk is the 
same as that of the halo. Unlike Mo et al. ( |1998| ), who considered adding a bulge component as 



^Our results will not depend sensitively on this choice. The alternate form of Moore et al. (1998) would work just 
as well. 
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a point mass, we added a bulge modeled as a Hernquist (1990) profile with mass Mh = iribM^ij- 
and a Hernquist scale length at,. In contrast to the disk, there is no conserved quantity like the 
angular momentum that can be used to determine the bulge scale length. Instead we simply 
used a phenomenological scaling^ that a = 0.045rrf. We assumed that the total specific angular 
momentum of the baryons was the same as the dark matter, but that all of the angular momentum 
is in the disk component {jd = + rrif,) while the bulge has none. 

The dark matter is adiabatically compressed by the cooled baryons, which for a spherical 
system of particles on circular orbits means conserving the mass and angular momentum 



(e.g. Blumenthal et al. |1986D . Thus, a dark matter particle initially orbiting at radius ends 



up at radius r where riMi{ri) = rMf{r) and Mi{ri) and Mf{r) are the initial and final mass 
distributions. The initial mass distribution is simply that of the NFW halo, M^vFiyl^i)) while the 
final mass distribution is 

Mf[r) = Md{r) + Mh{r) + (1 - - mi,)MNFw{r-i}. (7) 

After making an initial estimate for the disk scale length, Mf{r) is found by iteratively solving 
Eq. ^ while adjusting the disk and bulge scale lengths to satisfy the disk angular momentum 
constraint (see Mo et al. 1998) 



= j-^J = 2-K R dR Vc{R)R S(i?) (8) 
Jo 

where Vc{R) is the rotation curve in the disk and S(i?) is the surface mass density of the disk. The 
rotation curve Vc{R) is computed assuming that the bulge and the halo remain spherical while the 
disk is an infinitely thin exponential disk. 



2.3. A Very Simple Cooling Model 

We treat cooling using two very simple models. The first is simply adding a cooling mass 
scale Mc by hand. We assume that the probability that a halo has cooled is given by 



Pcoo\{M) = 1 



1 + exp 



log M - log ^ ^ 



(9) 



with a cooling mass scale and an (arbitrary!) logarithmic distribution width of AMc = 0.1 to 
smooth the transition. Half of the halos with M = Mc have cooled, and the total distribution of 
halos is the sum of the cooled and uncooled distributions. For any observational test we can vary 
the cooling mass scale Mc until we achieve a reasonable match. 



In the photometric survey of spiral galaxies by de Jong (1996), bulge effective radii are typically Re — rd/lO of 
the disk scale length r^, with a logarithmic scatter of 0.17 dex. For a Hernquist model, the scale length is a = 0.45-Re 
of a de Vaucouleurs effective radius R^, so we use a = 0.045rd. 
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The second model is based on a simplified version of the cooling model used by Cole et 
al. (pOOOD in their semi-analytic models of galaxy formation. We will use this model both to 



check whether our estimate of the cooling mass scale (§ 3.3.1| ) is reasonable and as a complete 



self-consistent model (§ 3.3.2| ). As we are interested only in the boundary between galaxies and 



clusters, we neglect the complicated details of star formation and feedback and consider only 
cooled mass fractions. We assume that when the halo collapses, the gas is heated to the virial 
temperature ksT^iT = ^^^p^vir where nip is the proton mass and fi is the mean molecular weight. 
We assume that the initial density distribution of the gas is pgas oc (r^ + ''c)"^ ^ < '^vir where 
fc = fs/^- This is the phenomenological model which Cole et al. (|2000 ) develop based on more 



realistic collapse simulations with hydrodynamics. Given the temperature, the gas density and 
the Sutherland & Dopita ( 1986| ) cooling curves, we can estimate the cooling time as a function of 
radius, 

T-cool(r) = ■ ^ (10) 

where we have fixed the metallicity to Z = Zq /3 for simplicity. 

As a consistency check, we can estimate the age of a halo and the time for a given fraction 
of the baryons to cool to see if they match at a mass scale comparable to our estimated cooling 
mass scale Mc (§ |3.3.1] ). We can also use the cooling model to determine the cooled baryon 
fraction of each halo. Given the average age, tfoj-^{M,z), for halos of mass M and redshift z, 
we calculate the cooled baryonic mass fraction fcoo\iM,z) by the mass fraction inside the radius 
where the cooling time equals the age, t{z) — tfo^^{M,z) = Tcooi(?'cooi)- If the global baryon 
fraction is + mf,)o, then we model the halo with an adiabatic compression model having a 
baryon fraction + = f cooi{M, z){m(i + m;,)o, eliminating the cooling mass scale Mc as a 
parameter. Assuming that all halos start as fair samples of the universe, the global baryon fraction 
(m^ + mf,)o = r^b/r^m- Naively then, the only parameter of the model is the baryon density Qf,. 
In practice however, star formation complicates the interpretation. While controls the initial 
cooling of the halo, star formation can reheat the baryons once they have cooled so that the cool 
baryon fraction in a galaxy is less than the fraction which cooled initially (see e.g. SA). Since the 
adiabatic compression is due only to the cold baryons, we will call our parameter r^fe^cooi and we 
should find that f^ft cool 

< Oft. 



3. The Distribution of Gravitational Lens Image Separations 

The distribution of image separations A9 in gravitational lenses is directly related to the 
redshift-averaged distribution of halos dn/dM, where the conversion depends on the halo mass, 
redshift, and internal structure. Given a large well-defined lens sample, the image separation 
distribution will be the cleanest global probe of the halo mass function because the selection of 
the lenses is independent of the flux or surface brightness of the baryons in the halo. Lens surveys 
also avoid the distinction between low-mass (galaxy) and high-mass (cluster) systems required 
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for any other global estimate of the mass function. Wide separation lenses (usually defined 
by A9 > 3'/0) are created by groups and clusters of galaxies and small separation systems are 
created by individual galaxies. We will show, however, that a successful model of the separation 
distribution must correctly include both the mass scale below which the baryons significantly 
modify the structure of the halo and the changes in the structure of the halos produced by the 
cooled baryons. 

We will examine the distribution of image separations of the lenses found in the Cosmic 



Lens All-Sky Survey (CLASS; e.g. Browne & Myers 2000 ) for lensed flat-spectrum radio sources. 
The primary CLASS survey has a nearly uniform selection function for separations from 
O'.'S < A6I < e'.'O, with extensions to wider separations of 6'.'0 < A6l < 15'.'0 (Philhps et al. |200C| ). 



These surveys have found 18 lenses, all with separations A9 < 3'.'0. We also considered the 
separation distribution of all radio-selected lenses. This sample of 27 lenses is inhomogeneously 
selected, but includes two wider separation lenses (MG 2016+112 and Q 0957+561) and illustrates 
the uncertainties in the tail of the distribution. We include the angular selection function for small 
separation lenses, but with an outer limit of 15'.'0 we will be able to ignore the selection function 
for large separation lenses. 



3.1. The Need for Baryons 



The need for baryonic processing to explain the image separation distribution is most obvious 
when we consider the previous attempts to compute the distribution of lens separations based only 



on the properties of the dark matter (e.g. Narayan & White 1988 ; Kochanek 1995 ; Wambsganss 
et al. |1995| ; Wambsganss, Cen & Ostriker |1998| ; Maoz et al. |1997] ; Keeton |l99j ; Mortlock k 
Webster pOOQ ; Li & Ostriker pOOQ , Keeton & Madau |2000| , Wyithe, Turner & Spergel pOOQ ). In 
these models, most of which were intended only to explain the wide separation lenses, the mass 
function of the parent dark matter halos is calculated using the Press-Schechter ( |1974 ) theory and 
its extensions. Given a mass function, the lens properties are then calculated by assuming a model 
for the density distribution of the halos. When the models are normalized so that they predict the 
correct local abundance of massive clusters, they correctly predict that wide separation lenses are 
rare. However, they catastrophically fail to explain the distribution of smaller separation lenses. 

A purely phenomenological approach based on the local properties of galaxies, by contrast. 



predicts the observed properties of lenses quite well (e.g. Kochanek 1996, Keeton, Kochanek & 
Falco 1998| ). These models usually combine local galaxy luminosity functions with local kinematic 
relations to predict the distribution of lenses assuming a constant comoving density of galaxies, 
although a few studies have considered the effects of number evolution and merging (e.g. Mao 



& Kochanek 1994; Rix et al. 1994). These models have modest difficulty explaining the largest 
separation lenses observed in systematic surveys (separations of A9 ~ 6'.'0), and cannot explain 
the lensing effects of rich clusters. 
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Keeton ( |1998 ) pointed out that the origin of the problem lay in ignoring the extra physics 
which makes the density structure of the lenses depend strongly on the mass scale. Any model 
based on the mass function of dark matter halos which assumes that the density distributions 
of the halos vary smoothly and continuously with mass leads to predictions for the separation 
distribution that include far too many wide separation lenses compared to small separation lenses. 
Keeton (|1998| ) demonstrated the effect by showing that using either SIS (singular isothermal 
spheres) or NFW density profiles for all lenses could not explain the observations. Only by 
introducing a baryonic cooling mass scale Mc below which the halos were modeled as SIS lenses 
and above which they were modeled as NFW halos could the observed properties be explained. 
In particular this change in structure explains why many lenses found in groups of galaxies were 
associated with the galaxies in the group rather than the group halo, even though the group halo 
had to be more massive than its component galaxies. 

Porciani & Madau ( |200C| ) used this hypothesis in their models for the separation distribution 
of lenses and found that the mass scale below which the halos had to cool and have the SIS 
density profiles was Mc = 3.5 x IO^^Mq. We shall find a similar cooling mass scale, though our 
model is slightly different than that of Keeton ( |1998| ) and Porciani & Madau ( pOO0| ) . We also show 
that such a mass scale is naturally predicted by the properties of halos in hierarchical structure 
formation models if the baryon density is near the value preferred by big-bang nucleosynthesis 
(e.g. O'Meara et al. |200lD. 



3.2. The Qualitative Effects of Baryons on Lensing Properties 

We start by illustrating the effects of the baryons on the lensing properties of a Mvir = lO^^M© 
halo at redshift zi = 0.5 (implying a concentration of c = 8), spin parameter A = 0.04, baryonic 
mass fraction + nih = 0.05, and baryonic angular momentum fraction = 0.05. Fig. || shows 
the rotation curves for the initial dark matter distribution and the final matter distribution given 
bulge-to-disk mass ratios of mfe/m^ = 0.0, 0.10 and 0.20. We see the familiar boost in the circular 
velocity due to the disk and the compression of the dark matter (e.g. SA). The bulge supports 
the central rotation velocity and slightly reduces the outer rotation velocity because the angular 
momentum per unit mass in the disk is rising as we increase the mass of the bulge. It is instructive 
to note that the mass profile, angular momentum profile and rotation curve of our compressed 
halos are quite different from what a pure dark matter simulation would predict. In particular the 
rotation curve only "recovers" to the pure dark matter form at very large radius (not shown on 
Fig. i. 

In Fig. I we also show the deflection or bending angle profile a{x) produced by the lens 
assuming it lies at zi = 0.5 and the source lies at Zg = 2. To simplify the problem we consider only 
the face-on lensing properties, but we should keep in mind that systems with flat disks are more 



efficient lenses when inclined (see Keeton &: Kochanek 1998). With these parameters, the initial 



dark matter distribution is a sub-critical lens and can produce no multiple images of the source. 
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Fig. 1. — (Left) The rotation curves for IO^'^^Mq halos at zi = 0.5 with concentration c = 8. The 
dashed curve shows the rotation curve of the initial NFW halo. The solid curves show the rotation 
curves found after the compression of the halo by the baryons assuming a baryonic mass fraction 
of rud + mb = 0.05, a spin parameter A = 0.04, and that the disk contains all the initial baryonic 
angular momentum, = 0.05. The three solid curves are for bulge-to-disk mass ratios nHf/md = 
(highest peak Vc), 0.1 and 0.2 (lowest peak Vc) respectively. The vertical line shows the disk scale 
length. (Right) The bending angles, a(x), produced by the same halos assuming a source redshift 
of Zs = 2.0. The dashed curve shows the deflection produced by the initial NFW halo. The solid 
curves show the bending angles produced after the compression of the halo by the baryons where 
the central deflection profile becomes steeper as we increase the bulge fraction. For comparison, the 
horizontal dashed lines show three simple comparison models: an SIS lens normalized by the halo 
virial velocity v^r (NFW/SIS, Vmr)-, an SIS lens normalized by the peak circular velocity of the 
compressed halo, (AC/SIS, 

^c,max)) &iid a Mestel disk lens normalized by the peak circular velocity 
of the compressed halo, (AC/Mestel, Uc,max)- The tangential critical line x+ of a lens (the Einstein 
ring) is located at the point where the (dashed) 45° line intersects the bending angle, and the radial 
critical line X- is located where a 45° line is tangent to the bending angle. An arrow points to the 
location of the tangential critical line of the adiabatically compressed models. The vertical solid 
line shows the disk scale length. 
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Adding the disk and compressing the dark matter pushes the lens above critical, although not by 
a large amount. The radial and tangential critical radii are comparable, and the lens will generally 
produce visible central (odd) images which are almost never seen in real systems. 

The problem is that exponential disks have little density contrast between their centers and 
the disk scale length (~ 4/i~^ kpc) while an efficient lens requires a large density contrast over this 
region (for a recent discussion, see Rusin &: Ma 2001| ). The sensitivity of the lensing properties to 



the central density means that adding even a small bulge enormously increases the efficiency of 
the galaxy as a lens. For bulge-to-disk mass ratios of rrnj/rrid = 0.05, 0.10, and 0.20 the multiple 
imaging cross section increases by a factors of 6, 12 and 23 compared to the model with no bulge. 
Moreover, the models with bulges will generally have demagnified central images, as observed in 
real lenses. In short, the details of the central density distribution are critical to the statistical 
properties of the lens, and our adiabatic compression models are inadequate for providing detailed 
predictions of the central density distribution. 

The adiabatically compressed models are complicated, so it is of some interest to see how 
they compare to the far simpler SIS and face-on Mestel disk lens models. Both models have flat 
rotation curves and constant deflections equal to half the image separations, but if normalized to 
the same circular velocity they produce different image separations of AOsis = ^'^{vc/c)'^ Dls / Dps 
and /\6 Mestel = 8{vc/c)'^Dls/Dos because a thin disk requires less mass than a spherical 



distribution to produce the same circular velocity (see Keeton & Kochanek 1998| ). Because the 
inner rotation curves of the adiabatically compressed models are dominated by the disks, a Mestel 
model normalized to the peak circular velocity provides a better match to the deflections of the 
adiabatically compressed models than an SIS model. Using an SIS model with a circular velocity 
equal to the initial halo virial velocity, Vmr = {GMmr / RvirY^'^ ■, also provides a good match to the 
deflection scale for this model. Since most of the known lenses are produced by bulge-dominated, 
early-type galaxies (see Keeton et al. |1998D and seem to require mass distributions very similar 



to the SIS model (e.g. Cohn et al. 200 1| ), it is interesting to note that the shape of the deflection 



profile becomes more similar to that of the SIS model as we increase the bulge mass fraction. 



3.3. The Image Separation Distribution and the Density of Baryons 

We explore the role of the baryons in producing the observed separation distribution of 
gravitational lenses in four stages. First, we illustrate the problem created by ignoring the baryons. 
Second, we show that introducing a mass scale, Mc, below which the halos are compressed 
explains the observed distribution. Third, we show that the required mass scale is consistent with 
simple models of cooling physics. Fourth, we show that the fundamental variable controlling the 
separation distribution is the cosmological density of cooled baryons. Some technical details of our 
calculations are relegated to Appendix 
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Fig. 2. — Predicted separation distributions without a cooling scale. The curves show the integral 
separation distribution normalized to the total number of CLASS lenses. The heavy solid line 
shows the distribution predicted by pure NFW models while the light solid (dashed) lines shows 
the distributions predicted by the adiabatic compression models with no bulge (a 10% baryonic 
mass fraction bulge). The wiggles in the pure NFW curve are a small discretization problem. 
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3.3.1. The Cooling Scale 

The first two stages reproduce the arguments originally developed by Keeton ( |199^ ) and 
Porciani k Madau ( pOOOD - Consider, standard model, a set of halos with rud + mb = 0.05, 



A = 0.04, and jd = 0.05 with {rrih/md = 0.1) and without {mi/rrid = 0) a bulge. If we ignore the 
existence of a characteristic mass scale dividing galaxies from groups and clusters, then we predict 
image separation distributions with far more wide separation lenses than are actually observed. 
The problem does not depend on the choice of model. Keeton ( |199^ ) demonstrated it using NFW 
and SIS models for the lenses and we illustrate it in Fig. |2| for NFW models and adiabatically 
compressed models both with and without a central bulge. While the low mass halos have higher 
NFW concentration parameters, which makes them more efficient lenses, the differences are not 
large enough to produce a distribution cutting off sharply at IS.6 ~ 3'.'0. 

The solution is to introduce an abrupt change in the structure of the objects at the mass 
scale Mc dividing galaxies and clusters. Porciani & Madau ( |2000 ) used SIS models normalized 



by the local properties of galaxies below Mc and NFW models above Mc and could fit the 
observed separation distribution given a mass scale of Mc = 3 x 10^^ Af©. We use the adiabatically 
compressed models below Mc and the NFW models above Mc. In our models, Mc is simply the 
mass at which 50% of halos have cooled based on the probability distribution defined by Eq. @ . 
The compressed halos have far higher cross sections per unit mass than the original NFW halos 
(see j ^3.2| ), leading to an increase in the fraction of small separation lenses. Fig. ^ shows the 
predicted distributions as a function of Mc. 

If the cooling mass scale is too large, Mc ^ 3 x IQ^'^Mq, then we find the distribution 
predicted by the adiabatically compressed models without a cooling scale (Fig. ^) and cannot 
match the observations. If the cooling mass scale is too small, Mc ^ lO^^M©, then we find the 
distribution predicted by the NFW models for large separations combined with a sharp peak at 
small separations. Only if the break is at a mass scale Mc ~ lO^^M© can the models reproduce 
the observed distribution. Interestingly, cosmological hydrodynamic simulations also find that the 



cooled baryon fraction reaches 50% on mass scales near Mc ~ 10^'^ M0 (e.g. Pearce et al. |1999D 



We can quantify the goodness of fit by using the Kolmogorov-Smirnov test (e.g. Press et 



al. 1992 ) to compute the likelihood Pks that the model produces a separation distribution 
consistent with the observations. Fig. ^ shows Pks as a function of Mc for a range of models. For 
a fixed baryonic fraction (m^ + m;,), the optimal value depends on the assumed structure of the 
low mass halos. For example, in our models Mc increases from Mc — 5 x 10^^ to Mc — W^^Mq 
when we add a rrib/rad = 0.1 bulge. By increasing the lensing efficiency of the low mass galaxies 
relative to the high mass galaxies (by making them more supercritical lenses), the addition of the 
bulge drives the estimate of the cooling mass scale upwards. An SIS model would further increase 
the relative lensing efficiency of the low mass galaxies, which probably explains why Porciani & 
Madau (|2000D found a stih higher break mass: Mc ~ 3 x lO^^M©. 



The optimal mass scale depends on the baryon fraction, nid + mj,. With less baryons, a halo of 
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Fig. 3. — Predicted separation distributions with a cooling scale for models with a mi/rrid = 0.1 
bulge (right) and without a bulge (left). The dashed curves show the distributions for = 
IO^^Mq, 3 X IQ^'^Mq and lO^^M© while the solid curves show the distributions for 3 x 10^^ and 
10^^ Mq 3 X 10^^ M0 and lO^^M©. The curves are labeled by log Mc. The heavy solid (dashed) 
curve shows the observed distribution of the CLASS (radio-selected) lenses. 
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Fig. 4. — The Kolmogorov-Smirnov probability, Pks, of fitting the observed separation distribution 
of CLASS lenses as a function of the cooling mass scale Mc- The heavy (light) solid curves 
indicated by the arrow show the K-S probability for models with + = 0.05 without (with) 
a m,[,/m,(i = 0.10 bulge. The heavy dashed curves show the K-S probabilities for models with lower 
{mb + nid = 0.01 and 0.02) or higher {mb + nid = 0.10 and 0.20) baryon fractions where the optimal 
cooling mass decreases as the baryon fraction rises. The light dashed curves show the cooling time 
in units of 10 Gyr for the radii enclosing 50% of the baryonic mass for the standard model. The light 
solid line shows the time since the average formation epoch ((iform)) in units of 10 Gyr assuming 
h = 0.67. 
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a given mass becomes a less efficient lens producing smaller image separations, so Mc must increase 
to keep the observed break at a fixed image separation scale. If the baryon fraction becomes too 
low to significantly compress the halos (m^ + rrih ^ 0.01), it becomes impossible to explain the 
observations. The mass scale required to explain the observations depends exponentially on the 
baryon fraction, with logiQ Mc/Mq ~ 13.6 — (m^ + mfe)/0.15 for the models without a bulge. 
These trends are also shown in Fig. ^ 



3.3.2. A Self- Consistent Model 



These models, and the earlier models by Keeton (1998) and Porciani & Madau ( 2000| ), 



introduced the break mass as an ad hoc means of separating galaxies and clusters into separate 
lens populations. Physically this break mass is the cooling mass scale, which divides halos in which 
the baryons have cooled from those in which they have not, and its value should be predictable 
from the basic physics of cooling. 

As a first step we computed the cooling times at z = for the radius encompassing 50% of 
the baryons in halos with + rub = 0.05, and compared it to the time elapsed since the average 
formation time of the halos. These time scales are superposed on Fig. |^. As expected, the cooling 
times are comparable to the time available for cooling near the mass scales required to explain 
the distribution of image separations. Note, however, that agreement between the cooling mass 
scale and the break mass required to explain the lenses depends strongly on the baryon fraction 
rud + nib. The two scales agree for our fiducial model with + nib = 0.05. For higher baryon 
fractions the break mass required to explain the lenses is well below the cooling mass scale, and for 
lower baryon fractions it is well above the cooling mass scale. Fig. Q underestimates the problem 
because the cooling time also depends on the baryon fraction as TcqoI oc p~g^. The cooling time 
drops when we increase the baryon fraction, which will exacerbate the discrepancies between the 
cooling mass and the break mass needed to explain the lenses. Thus, cooling physics makes the 
baryonic mass fraction, + nib, of the cooled halos the key parameter relevant to determining 
the distribution of image separations. 

In our final models the only parameter is the halo baryon fraction (m^ + mb)o which should 
equal i^b/^m for halos which are a fair sample of the universe. The cooled baryon fraction 
is then determined from our simple model of the cooling physics from §2.3. Figure ^ shows 
the separation distributions computed using our cooling model as a function fife. cool i where the 
structure of each halo is set by the adiabatically compressed models with a cooled baryon fraction 
of TTT-b + nid = fcooii^, z){mb + m^jQ and {nib + "^(i)o = ^b.cooi/^m- The qualitative behavior 
of the models is similar to the more phenomenological models based on the cooling mass scale 
Mc. Low rift^cooi models have difficulty cooling, making them equivalent to models with a low 
cooling mass scale. High rifc.cooi models cool easily, making them equivalent to models with a 
high cooling mass scale. Models with 0.015 ^ f^fexooi ^ 0.025 agree with the observed separation 
distributions independent of which data we fit (CLASS lenses or all radio lenses) or the assumed 
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Fig. 5. — Image separation distributions as a function of ^b,coo\ using models with a mh/rrid = 0.1 
bulge (right) or without a bulge (left). The dashed curves show the distributions for rib,cooi = 0.003, 
0.006, and 0.009 (from right to left at large separation), and the solid curves show the distributions 
for J^bxooi = 0.0012, 0.015, 0.018, 0.021, 0.024, 0.030, 0.045 and 0.060 (from left to right at large 
separation). The heavy solid (dashed) show the observed separation distribution of the CLASS (all 
radio) lenses. 
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Fig. 6. — Kolmogorov-Smirnov test probability of fitting the separation distribution of CLASS 
lenses as a function of ^lb,cooi- The squares (triangles) indicate models with no bulge (with a 
rrib/md = 0.1 bulge), and the solid (dashed) lines correspond to fitting the CLASS lenses (all radio 
lenses). The point with horizontal error bar is the estimate by Fukugita, Hogan & Peebles ( |199^ ) 
for the cold baryon (stars, remnants, cold gas) content of galaxies. The vertical line marks the total 
baryon content in the concordance model. 
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density structure (with or without a rrib/md = 0.1 bulge). The hkeUhoods as a function of r^b^cooi 
are shown in Fig. ^. The results are insensitive to modest errors in the cooling function, as raising 
and lowering the cooling curve by factors of two only affects the estimated baryon fraction by 
20%. With the introduction of the cooling physics, there is no trivial scaling of the results for the 
value of the Hubble constant. 

While the preferred range is less than the total baryon density ^Ib = 0.04 in the input 
cosmology, it significantly exceeds the estimates of 0.0045 ^ f^fe,cooi ^ 0.0068 for the cool 
baryon fraction (stars, cold gas and stellar remnants) in local galaxies by Fukugita, Hogan & 



Peebles ( 1998 ). This discrepancy could have two explanations. First, it could be a problem in 
our models. The adiabatic compression models are crude approximations for the transformation 
of the dark matter halos by the baryons. While the estimates of 1^6, cool appear to be insensitive 
to changes in our assumptions, we know that the observed lens population is dominated by 
early-type galaxies whose baryonic density structure is very different from the assumptions used 
in our models. It is difficult to adequately address this possibility, since it is currently impossible 
to compute the final structure of a galaxy starting from the initial halo properties. Second, 
the accounting for the baryons in galactic halos may be incorrect. The Fukugita et al. ( [1998| ) 
accounting for the bayrons in galaxies included only cold gas components, neglecting hot (10^ K) 
and warm, ionized (10^-10^ K) components. While hot gas cannot contribute the adiabatic 
compression of the halo, the warm components are both difficult to detect and contribute to the 
compression. Perhaps our best route towards understanding the detailed physical modifications of 
the dark matter halos by baryonic processes in hierarchical models is through simulations, but at 



present these have proven extremely difficult (see e.g. Navarro & Steinmetz 2000| ). 



The Distribution of Circular Velocities 



Given our success in fitting the distributions of gravitational lens image separations, it is 
natural to ask whether our model agrees with estimates of the local velocity function. Global 
estimates of the local (circular) velocity function, dn/dlogVc, are difficult because galaxies and 
clusters have very different dynamical properties and require different observational methods. We 
first construct a rough estimate of the local velocity function including both galaxies and clusters, 
and then compare it to our best self-consistent models. We stress that most of the pieces in our 



model have been addressed before by other authors (e.g. Turner, Ostriker & Gott |1984|; Cole 
& Kaiser |1989| ; Fukugita & Turner |1991j ; Shimasaku |1993| ; Kochanek |1995| , |1996| ; Gonzalez et 



al. 200C ; Sigad et al. 2001) here we attempt to focus on the combination of cluster and galaxy 
scales to emphasize the effect of cooling in producing a change in the velocity function. 



We will use the velocity function for local galaxies derived by Pahre, Kochanek & Falco ( |2001| ), 
which has significantly reduced systematic uncertainties compared to earlier estimates. The 
velocity function for groups and clusters is notoriously difficult to estimate. We will combine the 
X-ray temperature function estimate by Blanchard et al. ( |2000 ) with the average X-ray relation 
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Fig. 7. — The velocity function dn/dlogVc = (dn/(ilogM)|(ilogM/(ilog?;c|. The solid curves show 

the local velocity function of galaxies (low Vdrc) and clusters (high Vc) and their sum. The points are 
the non-parametric velocity function of galaxies. From bottom to top, the dashed curves show the 
velocity functions derived using dn/dM and the NFW virial velocity (labeled NFW fyir), the peak 
circular velocity of the NFW rotation curve (labeled NFW Uc,max) and the peak circular velocity 
of the adiabatically compressed model (labeled cooled Ucmax)- We used the fi6,cooi = 0.018 model 
with no bulge. 
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Fig. 8. — The global relation between mass and circular velocity at redshifts zero (solid) and unity 
(dashed). The heavy curves show the peak circular velocity of the NFW model. The light curves 
show the peak circular velocity including the baryonic cooling and adiabatic compression from the 
^^fe,cooi = 0.018 model. The upper light curve is the model with no bulge component {rrib/md = 0) 
and the lower light curve is the model with a bulge {rrib/md = 0.1). The bulge slightly reduces 
the peak rotation velocity (because it increases the angular momentum per unit mass of the disk) 
while making the rotation curve flatter (see § |3.2| ). 
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Fig. 9. — The velocity function at redshift zero (solid) and unity (dashed). The heavy curves show 
the results for the dark matter model. The light curves show velocity functions for halos in which 
at least 1/3 of the baryons have cooled in the ^b,coo\ = 0.018 model. The upper light curve is 

the model with no bulge component {m,h/mj = 0) and the lower light curve is the model with a 
bulge {mh/md = 0.1). Superposed on the distributions are the points from the velocity function of 
galaxies at z = (see text). 
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between the X-ray temperature and the galaxy velocity dispersion derived by Wu et al. ( 1999 , eqn. 
(14)) to estimate the velocity function for groups and clusters. This velocity function estimate 
is nearly identical to a simple thermodynamic conversion using Vc = {2kT / fimp)^/"^ to estimate 
the circular velocity. There are non-trivial systematic uncertainties in our final result because of 
the problems in uniformly relating late-type galaxy rotation curves, early-type galaxy velocity 
dispersions and X-ray temperatures to a common circular velocity scale. While the upper velocity 
limits of galaxies are unambiguous, the lower velocity limits for groups are unknown because of the 
difficulty in unambiguously recognizing and counting low mass groups. For simplicity we simply 
truncate the cluster contribution at temperatures below 0.5 keV, which introduces a small kink in 
the distribution. Fig. |^ shows the estimated velocity function, which has a prominent break near 
Vc ~ 400 km s~^ at the boundary between the high-mass galaxies and the low-mass groups. This 
break in the velocity function is the local signature of the cooling mass scale, and its amplitude is 
far larger than any systematic uncertainties in our construction of the velocity function. 

We can convert the halo mass function into the velocity function given a model relating the 
mass M to the circular velocity Vc- Ignoring the baryons we could use either the virial velocity of 
the NFW halo, v^h = (GMvir/rvir)^^^ or the peak circular velocity Uc,max of the rotation curve of 
the halo. A variable transformation, dn/d\ogVc = {dn/dlogM)\dlogM/dlogVc\ relates the mass 
function to the velocity function. Fig. ^ shows the predictions for the velocity function, and Fig. ^ 
shows the relationship between M and Vc for the two cases. As expected for a model normalized 
to match the abundance of rich clusters, the dark matter model produces a reasonable match to 
the velocity function of clusters given the uncertainties in the construction of the velocity function. 
The dark matter model catastrophically fails to match the velocity function of galaxies, grossly 
underpredicting the density for Vc ~ 200 km s~^ and grossly overpredicting it for Vc ^ 100 km s~^. 
These discrepancies are created by the effects of the baryons in distorting the halo structure (see 
also Gonzalez et al. 2000] ). 



The break in the velocity function should be reproduced by our model for the distribution of 



image separations. Fig. g shows the relationship between Vc and M for the model from §3.2 with 
^b,cooi = 0.018 which provided a good fit to the distribution of lens separations. For large masses, 
where the baryons have not cooled, the peak circular velocity matches that of the dark matter 
models. For small masses, all the baryons have cooled and the circular velocity curve is shifted 
upwards. Near M ~ W^^Mq there is a break in the curve between the uncompressed and fully 
compressed slopes. The model velocity function (Fig. |^ now has a break at the same velocity scale 
as the observed velocity function. The adiabatic compression shifts the more numerous low-mass 
halos to a higher circular velocity, bringing the density of halos with Vc ~ 200 km s^^ close to 
that observed. The models still overpredict the density of very low mass halos, Vc ^ 100 km s^^ 



probably because star formation reheats and/or disrupts low mass halos (see Gonzalez et al. 2000). 
These low-mass halos had no impact on the distribution of lensed image separations because 
the ~ Vc scaling of the lensing cross sections and the finite angular resolution of the surveys 
(AO ~ 0'.'25) makes the lens surveys insensitive to these mass scales. To fix this problem we would 
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need to adopt a similar scheme to that used in the semi-analytic models (see, e.g., SA) to fix the 
overabundance of low-luminosity halos: appeal to feedback to suppress star formation in these 
"galaxies" . 

While the break in the velocity function consistent with the distribution of lens separations 
is located on the right velocity scale, the agreement is not perfect, principally because our model 
produces a peak in the velocity function at approximately Vc — 500 km s~^. No similar peak 
is seen in the distribution of image separations because the separation distribution is smoothed 
by the redshift distribution of the lenses. The peak is created (mathematically) by the region 
near M = W^^Mq where the Vc-M relation is flat. The flat region of the the Vc-M relation is 
produced by two problems in our model. The first is the "over-cooling" problem common to many 
semi-analytic models (see SA) including the Cole et al. ( pOOOj ) model we used. The cooled mass 
and the resulting star formation predicted for halos on group mass scales are too large, leading 
to super-luminous galaxies which are not observed. More complicated models can significantly 
reduce the problem (see Cole et al. 2000| ) . The second problem is that we assumed a deterministic 



relationship between mass and circular velocity by using an average formation time rather than a 
distribution of formation times for each halo. More complicated models which use a distribution 
of formation times (e.g. Cole et al. ^OOOj ; Newman & Davis ^OOOj ) would smear out the feature. 



Newman Sz Davis ( |2000 ) proposed using the evolution of the velocity function of galaxies as 



a probe of the cosmological model. Our experience here suggests that uncertainties in baryonic 
physics relating the observed galaxy velocities to those of the "underlying" dark matter halo 
could impact this proposal. Fig. ^ shows the Vc-M relation both today and at redshift unity. At 
fixed halo mass, the circular velocity of the dark matter models increases with look back time 
because the average density of the halos increases (Navarro, Frenk & White [l996| , |l997|) . With 



the inclusion of the baryons, the evolution of the Vc-M relation is considerably more complicated. 
First, because the time available for cooling is significantly less at redshift unity than today, 
the cooling mass scale evolves with redshift. In these models the cooling mass scale today is 
M ~ 6 X IO^^Mq, while at redshift unity it is M ~ 3 x lO^^M©, based on the point where the 
Vc-M relation begins to return to the dark matter relation. Because the cooling mass determines 
the mass of the most massive and luminous galaxies, magnitude limited studies of galaxies will be 
very sensitive to the evolution of the cooling scale because they are dominated by galaxies near the 
characteristic luminosity created by the existence of a cooling mass scale. Second, at lower masses 
where we might avoid problems due to the evolution of the cooling mass scale, the evolution in 
the circular velocity at fixed mass differs between the dark matter and adiabatically compressed 
models. The adiabatic compression modestly reduces the change in the circular velocity at fixed 
mass compared to the dark matter models. Third, the relationship between mass and circular 
velocity depends on the detailed of the distribution of the baryons. The models with a bulge 
have somewhat lower peak rotation velocities than the models without because of our choice for 



distributing the baryonic angular momentum (see § |2.2| and § p.2| ) 



Fig. shows the evolution of the velocity function based on these different Vc-M relations. 
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At fixed peak circular velocity, the number density of halos increases between today and redshift 
unity by about 25%. We also show the evolution based on the peak circular velocity distribution 
of halos in which at least 1/3 of the baryons have cooled. The number density at fixed circular 
velocity still increases, but the ratio is slightly smaller. We see that the choice of density model 
has reasonably strong effects, and the behavior of the velocities near the cooling scale becomes 
more complicated. In fact the differences between the models with and without a 10% bulge are 
equal to the differences created by evolution. This means that the details of the baryonic mass 
distribution, and the evolution of the baryonic mass distribution are at least as important as 
cosmology to the evolution of the velocity function of galaxies. For example, in most models of 
galaxy formation, the distribution of bulge parameters is itself an evolving quantity, if nothing else 
because the relative numbers of late-type disk galaxies and early-type, bulge-dominated galaxies 
evolves (see SA; for observational evidence, see, e.g. van Dokkum & Pranx p001| ). We have also 
superposed the Pahre et al. ( |2001| ) estimate of the velocity function to emphasize that our theory, 
even with the baryons, shows far larger differences in shape from the model predictions than the 
magnitude of the evolutionary effects. For these reasons we believe that the evolution of the 
velocity function with redshift will most likely provide crucial information about the process of 
galaxy formation, before it can be used to discriminate between different models for the evolution 
of the expansion rate. 



5. Conclusions 

The mass and velocity distributions of virialized objects include a feature, the baryonic cooling 
scale Mc, which divides halos which host galaxies from those which host groups and clusters. 
The cooling not only produces differences in morphology, but also alters the density distributions 
and the dynamical structure of the halos. This significantly influences the relationship between 
observable kinematic properties of galaxies and the properties of their "primordial" parent halo. 
These baryonic effects can go a long way towards reconciling theoretical predictions for the 
distribution of lensing separations with observations, and explaining some of the features of the 
galaxy and cluster velocity function. 

The baryonic compression which helps support the rotation curves of galaxies also leads to an 
enormous increase in the cross section for the halos to be gravitational lenses, converting shallow 
NFW density profiles into profiles which begin to resemble the steep singular isothermal spheres 
preferred as models for gravitational lenses (Cohn et al. |200l| ). The sharp increase in the lens 
cross section produced by cooling the baryons leads us to expect a "break" in the separation 
distribution of gravitational lenses at the separation scale corresponding to the cooling mass scale. 



The existence of such a break was first discussed by Keeton ( 1998 ) to explain why the lenses found 
in systematic surveys were always associated with galaxies even when the galaxies were group or 
cluster members. A model incorporating an ad hoc break was fit to the distribution of lensing 



separations by Porciani & Madau (2000) and used to constrain the cooling scale to the range 
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We have shown that a simple, self-consistent model based on adiabatically cooled NFW 
profiles can fit the observed distribution of gravitational lens separations. In this model the lenses 
select a cooled baryon density 0.015 ^ f^6,cooi ^ 0.025 independent of the lens sample we fit or om' 
modeling of a galactic bulge. While our model is self-consistent, in the sense that it uses calculable 
properties of dark matter halos to convert from unmeasurable quantities to observable quantities, 
it is clearly simplistic at best. In particular we have neglected heating processes associated with 
star formation, leading to cold baryon fractions in groups and clusters which are too high and to 
an overestimate of the number density of low circular velocity galaxies or small separation lenses. 
It is gratifying that despite these shortcomings, the main features of the observations can still be 
understood in terms of this simple physics. 

When we predict the local distribution of halos in their observed circular velocity using the 
same model that fit the distribution of image separations, we find that our model also has a feature 
in the local velocity function at the velocity scale dividing galaxies from clusters. The comparison 
also makes it clear that additional physics (star formation, feedback and realistic treatment of 
the differences between disks and bulges) and significant improvements in the cooling model are 
necessary to make the agreement any better than qualitative, particularly at the low Vc end of the 
distribution (as has been noted by e.g. Gonzalez et al. 2000| ). 



The shortcomings of our model related to star formation and feedback could be improved 
by using full semi-analytic models. The density distribution on the other hand defies a simple 
treatment. Our "galaxies" still have very large cores, even with the bulges, and would likely 
predict observable central images, which are never seen in practice. To make our models more 
realistic we would need to have density distributions which are closer to isothermal spheres, which 
we would need to put in by hand as done in the semi-analytic models or the earlier lens studies of 
Keeton ( |l99^ ) and Porciani & Madau ( pOO0| ). 



Finally, when we explore the evolution of the velocity function of galaxies, we find that the 
effects of the baryons are non-negligible. It is only at extremely large radii that the "velocity" of a 
galaxy provides a measure of the mass uncontaminated by baryonic processes. On smaller scales 
the measured quantity is related to the desired one by complicated and poorly understood physics. 
Further, the shape of the observed velocity function at low-Vc suggests that either observational 
selection effects cause a drastic underestimate of low velocity systems or the efficiency of star 
formation is much lower in these systems than their higher velocity counterparts (or both). We do 
not currently understand how these effects would evolve with redshift. In addition, the amount of 
baryons, the fraction of the baryons which cool and its evolution, and the evolution in the average 
distribution of the baryons (e.g. bulge fractions) all produce changes in the velocity function that 
are at least as important as the evolution in the underlying mass function. 
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A. Simplifications to the Lens Calculations 

To estimate the lensing properties of the system we assume we view the halo face on to 
the disk and then compute the mass M{R) enclosed by the cylindrical radius R. By doing the 
calculations for a face-on disk we will underestimate the total lensing cross section (see Keeton 
& Kochanek |1998| ). The bending angle of the lens is then cto = 4GM(< R)/(?R and the lensed 



images are found as the solutions of 

u = — — ± ao{R) = X ± q(x) (A1) 

i->OL UqS 

where u is the angular source position, x = R/Dql is the angular position of the image, 
a{x) = {Dls / Dos)oio{DoLx) and Dql-, Dls and Dqs are angular diameter distances between 
the Observer, Lens and Source (see Schneider, Ehlers &: Falco |1992| ). 

As is true for any circular lens, the tangential critical line (Einstein ring) is at the solution x+ 
of 1 — a(x+) = and the radial critical line X- is at the solution of 1 — a'(x_) = 0. The radial 
caustic lies at n_ = jx_ — a(x_)| and the multiple image cross section is cr = vrn^. When the 
source is on the radial caustic, there are two images on the critical line at x_ and an additional 
solution at Xout such that the mean magnification produced by the lens in the multiply-imaged 
region is (M) = (xout/^*-)^- 

For a survey limited to sources brighter than flux Fmin, the probability of finding a lens is 

P(i^min) = j D^dD J dM^^ulB{Fm,n) (A2) 

where D^dD is the comoving volume element, dn/dM is the redshift and mass dependent halo 
density, vru^ is multiple imaging cross section, and B{Fynm) is the magnification bias factor. 
We require the magnification bias term even when we consider only the distribution of image 
separations because the average magnifications produced by the the models differ. We will use 
the Einstein ring diameter (2x+) to estimate the image separation produced by the lens, and 
we compute the integral distribution in separation by appropriately adjusting the limits of the 
integrals in Eq. ([A^). 

The uncertainties in the lensing properties of the halos are dominated by the assumptions 
and problems in deriving the mass distributions. These make estimates of the absolute lensing 
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probabilities problematic. Therefore we have focused only on the relative lensing probabilities, 
particularly the distribution of the lenses in image separation. Because the shapes of the potentials 
vary widely for the different lens models, we need to include an estimate of the variations in 
the magnification bias between the models rather than simply computing the multiple image 
cross sections. It is a generic feature of lens models that when the cross section is reduced by 
rearrangements in the structure of the lens, the mean magnification rises. Thus the resulting 
increase in the magnification bias makes the probability of finding the lens change less rapidly 
than the cross sections (see, e.g. Kochanek |1996 ). We need a simple model which can rapidly 



estimate the effects of magnification bias. Fortunately, the magnification bias of the radio lens 
surveys is modest because the slope of the luminosity function for radio sources fainter than the 
survey flux limit is flat and the estimates will be insensitive to simplifications in the calculation. 

We made two simplifications in the properties of the radio sources being lensed. First, we 
assumed the sources lay at a fixed redshift Zg = 2 rather than integrating over a distribution of lens 
redshifts. Second, at that redshift we used an approximate version of the pure luminosity evolution 
flat-spectrum radio luminosity function from Dunlop & Peacock ( |1990| ). For x = P/Pc{z), Dunlop 
& Peacock ( 199(]| ) used a luminosity function of the form dn/dlogx = (i/(x" + x^) with a = 0.83 



and {3 = 1.96. We approximated this form by a broken power-law, p = d/x" for x < 1 and 
d/x^ for X > 1, to simplify the calculations of magnification bias. The approximate luminosity 
function differs from the original only near the knee. The break in the luminosity function lies 
at log Pc{z) = ao + aiz + aiz^ ^ where ao = 25.26, a\ = 1.18 and 02 = —0.28 and P (in units of 
W Hz^i str"^) is related to the flux (in Jy) by P = FDl{l + 2)^+", a = is the average spectral 
index assumed for flat spectrum sources, and Dq/{1 + z) is the angular diameter distance in an 

= 1 cosmology. The density factor d will not enter our calculation, since we considered only a 
fixed source redshift, but it is dQ{dVo / dz) / {dV/ dz) where do is a constant density, dVo/dz is the 
comoving volume factor in an 0^ = 1 cosmology and (dV/dz) is the comoving volume factor in 
the cosmology we used for our calculation. These corrections are required to convert the Dunlop 
& Peacock ( 1990| ) model from the = 1 model in which it was derived to the cosmology we are 



using in our calculation. 

We then simplified the magnification bias calculation by assuming that the magnification 
probability distribution was P{> M) = (Mg/M)^ for M > Mq where we could estimate the 
minimum magnification from the mean magnification produced by the lens, 2Mo = (M). This 
approximation is exact for a SIS lens and the functional form is asymptotically correct for any 
lens dominated by fold caustics. By combining our approximate magnification distribution with 
our broken power law model model for the luminosity function, we can analytically compute the 
magnification bias for CLASS sample. 
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